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I. IN THE SPECIFICATION 

Please replace the paragraphs at page 4, lines 1 1 - 17 with the following replacement 
paragraphs: 



-Figure 1 illustrates the 1993 version of the International Phonetic Alphabet. 

Figure 2 illustrates the method associated with the preferred embodiment of the present 
invention for determining a compact model to transcribe the Arabic language acoustically based 
on a well-defined basic phonetic study. 

Figure 3 illustrates in further detail the data extraction step of Figure 2. 

Figure 4 illustrates the composition of the maximal set described in the method of Figure 

2. 

Figure 5 illustrates the various kinds of phonemes. 

Figure 6 illustrations the reduction of maximal set for the text to speech system and the 
automatic speech recognition sets in a preferred embodiment of the invention.- 



Please replace the paragraph at page 5, line 17 - page 6, line 2 with the following 
replacement paragraph: 

—It should be noted that the International Phonetic Alphabet (IP A) was used in 
conjunction with this invention. The IPA, as defined by the International Phonetic Association 
(http://www.arts.gla.ac.uk/EPA/ipa.html) is a standard set of symbols for transcribing the sounds 
of spoken languages. The above mentioned website provides for a full chart of IP A symbols as 
reproduced in Figure 1 . Furthermore, charts for consonants, vowels, tones and accents, 
suprasegmentals, diacritics and other symbols are also provided. The last version of the IPA 
dates to 1993, as shown in Figure 1.-- 
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Please replace the paragraphs at page 8, line 12 - page 10, line 13 with the following 
replacement paragraph: 



? -Figure 2 illustrates the method 100 associated with the preferred embodiment of the 
present invention for determining a compact model to transcribe the Arabic language 
acoustically (based on a well-defined basic phonetic study). First, a language for which a 
compact model is to be developed is identified 102. Next, information regarding the identified 
language is extracted or collected 104. Data extraction can be accomplished via a variety means 
including, but not limited to: extracting data regarding the Arabic language via a network (such 
as the Internet, Local Area Network (LAN), Wide Area Network (WAN) or database (local or 
remote). Next, from the extracted data, a list is created where the phonological and phonetic 
units are defined 106. As a next step, the variations in the Arabic language are identified 108. 
For example, variations in classical Arabic, Modern Standard Arabic (MSA), and colloquial 
Arabic are identified. Next, a maximal set is created that contains all phonemes, allophones, and 
transliteration symbols associated with the Arabic language 110. Transliteration refers to the 
process of representing or spelling a word (in a first language) in the characters of another 
alphabet (second language). Lastly, the maximal set is reduced 1 12 to provide for a compact set 
to transcribe the Arabic language acoustically. The details of the reduction step are explained in 
detail in the following sections. 

The data extraction step of Figure 2 (102) is illustrated in further detail in Figure 3. With 
the extracted data, terminological problems are identified 202. Certain terms that have been used 
by several phonological linguists in their attempt to define and describe the nature of various 
Arabic sounds have proved invalid; i.e. whereas few linguists may include phonemes like 
/F7/,/R7/, and 1X1 into the category of Emphatics , others may include them in the category of 
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pharyngeals. As a result of this non-final consensus, the most appropriate category depending 

upon their influence on the neighboring vowels was selected. Next, transcription problems 

associated with the language in question (e.g., Arabic) are identified 204. In contrast to what the 

IPA exhibits in using special symbols (ASCII characters) which might cause technical problems 

if used in the present system; the transcription set was limited to include the characters which can 

be typed easily on the keyboard. Furthermore, phonological and phonetic units were extracted or 

collected 206 and a feature set was established based on this information 208. Next, a 

representative symbol for the transcription alphabet is selected 210 and a structured source is 

built 212. Our structured source consists of Phonemes, which are divided into three main units: 

Consonants, Vowels and Semi-Vowels. The unit "Consonants" includes a variety of Allophones 

and Geminations. Aallophones may have their own gemination variety. The unit "Vowels" has 

a variety of allophones only, while the unit "Semi-vowel" has just gemination variety. The 

features of these units are determined according to three conditions: Place, manner of articulation 

in addition to the nature of the sound being voiced or voiceless. 

Figure 4 illustrates the composition of the maximal set described in step 1 10 of Figure 2. 

Maximal set 300 comprises (but is not limited to): phonemes 302, allophones 304, a set of rules 

governing the selection of allophones 306, a set of examples 308, and the transliteration symbols 

310. It should be noted that although the preferred language of this application is Arabic, one 

skilled in the art could extend the present invention to cover other similar languages. A detailed 

description of the Arabic phonetic study as per the present invention is given below.-- 
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Please replace the paragraphs at page 12, line 10 - page 13, line 2 with the following 



replacement paragraphs: 



--The reduction of maximal set for the TTS and the ASR phonetic sets are described in 



detail in Figure 6. 

TTS Phonetic Set 

In the phonetic set for the TTS system, all the phonemes and allophones with which any 
given text message can be conveyed is found. For example, i) all the allophones for the vowels 
are identified; ii) allophones that represent any borrowed word in Arabic are identified, and iii) in 
the case of gemination, add symbols to represent the phoneme when it is geminated. Thus, 
geminated phonemes represented by doubling the original symbol, are represented by a new 
symbol.-- 

Please replace the paragraphs at page 21, line 1 - page 28, line 9 with the following 

replacement paragraphs: ' 

-TABLE 1 Arabic Phonetic Alphabet Table 



The Table Supports: 




1) Well Educated Pronunciation (Used in Text To Speech) 



2) In the Sound Features Field 

i. (+) = Voiced 

ii. (-) = Voiceless 
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TABLE 2 



wegd 




waqt 




WA6V 





(Different symbols that represent short Fatha) 



NE.OIm 




Nl.qld 




N2.6Ig 





(Different symbols that represent long Fatha) 
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